skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Guo, Lei"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Free, publicly-accessible full text available June 1, 2026
  2. ABSTRACT The abundance of various cell types can vary significantly among patients with varying phenotypes and even those with the same phenotype. Recent scientific advancements provide mounting evidence that other clinical variables, such as age, gender, and lifestyle habits, can also influence the abundance of certain cell types. However, current methods for integrating single-cell-level omics data with clinical variables are inadequate. In this study, we propose a regularized Bayesian Dirichlet-multinomial regression framework to investigate the relationship between single-cell RNA sequencing data and patient-level clinical data. Additionally, the model employs a novel hierarchical tree structure to identify such relationships at different cell-type levels. Our model successfully uncovers significant associations between specific cell types and clinical variables across three distinct diseases: pulmonary fibrosis, COVID-19, and non-small cell lung cancer. This integrative analysis provides biological insights and could potentially inform clinical interventions for various diseases. 
    more » « less
  3. Abstract Fertilization is a fundamental process that triggers seed and fruit development, but the molecular mechanisms underlying fertilization-induced seed development are poorly understood. Previous research has established AGamous-Like62 (AGL62) activation and auxin biosynthesis in the endosperm as key events following fertilization in Arabidopsis (Arabidopsis thaliana) and wild strawberry (Fragaria vesca). To test the hypothesis that epigenetic mechanisms are critical in mediating the effect of fertilization on the activation of AGL62 and auxin biosynthesis in the endosperm, we first identified and analyzed imprinted genes from the endosperm of wild strawberries. We isolated endosperm tissues from F1 seeds of 2 wild strawberry F. vesca subspecies, generated endosperm-enriched transcriptomes, and identified candidate Maternally Expressed and Paternally Expressed Genes (MEGs and PEGs). Through bioinformatic analyses, we identified 4 imprinted genes that may be involved in regulating the expression of FveAGL62 and auxin biosynthesis genes. We conducted functional analysis of a maternally expressed gene FveMYB98 through CRISPR-knockout and over-expression in transgenic strawberries as well as analysis in heterologous systems. FveMYB98 directly repressed FveAGL62 at stage 3 endosperm, which likely serves to limit auxin synthesis and endosperm proliferation. These results provide an inroad into the regulation of early-stage seed development by imprinted genes in strawberries, suggest the potential function of imprinted genes in parental conflict, and identify FveMYB98 as a regulator of a key transition point in endosperm development. 
    more » « less
  4. ABSTRACT Recent breakthroughs in spatially resolved transcriptomics (SRT) technologies have enabled comprehensive molecular characterization at the spot or cellular level while preserving spatial information. Cells are the fundamental building blocks of tissues, organized into distinct yet connected components. Although many non-spatial and spatial clustering approaches have been used to partition the entire region into mutually exclusive spatial domains based on the SRT high-dimensional molecular profile, most require an ad hoc selection of less interpretable dimensional-reduction techniques. To overcome this challenge, we propose a zero-inflated negative binomial mixture model to cluster spots or cells based on their molecular profiles. To increase interpretability, we employ a feature selection mechanism to provide a low-dimensional summary of the SRT molecular profile in terms of discriminating genes that shed light on the clustering result. We further incorporate the SRT geospatial profile via a Markov random field prior. We demonstrate how this joint modeling strategy improves clustering accuracy, compared with alternative state-of-the-art approaches, through simulation studies and 3 real data applications. 
    more » « less
  5. Abstract Current clustering analysis of spatial transcriptomics data primarily relies on molecular information and fails to fully exploit the morphological features present in histology images, leading to compromised accuracy and interpretability. To overcome these limitations, we have developed a multi-stage statistical method called iIMPACT. It identifies and defines histology-based spatial domains based on AI-reconstructed histology images and spatial context of gene expression measurements, and detects domain-specific differentially expressed genes. Through multiple case studies, we demonstrate iIMPACT outperforms existing methods in accuracy and interpretability and provides insights into the cellular spatial organization and landscape of functional genes within spatial transcriptomics data. 
    more » « less
  6. The remarkable diversity of leaf forms allows plants to adapt to their living environment. In general, leaf diversity is shaped by leaf complexity (compound or simple) and leaf margin pattern (entire, serrated, or lobed). Prior studies in multiple species have uncovered a conserved module of CUC2-auxin that regulates both leaf complexity and margin serration. How this module is regulated in different species to contribute to the species-specific leaf form is unclear. Furthermore, the mechanistic connection between leaf complexity and leaf serration regulation is not well studied. Strawberry has trifoliate compound leaves with serrations at the margin. In the wild strawberry Fragaria vesca, a mutant named salad was isolated that showed deeper leaf serrations but normal leaf complexity. SALAD encodes a single-Myb domain protein and is expressed at the leaf margin. Genetic analysis showed that cuc2a is epistatic to salad, indicating that SALAD normally limits leaf serration depth by repressing CUC2a expression. When both Arabidopsis homologs of SALAD were knocked out, deeper serrations were observed in Arabidopsis rosette leaves, supporting a conserved function of SALAD in leaf serration regulation. We incorporated the analysis of a third strawberry mutant simple leaf 1 (sl1) with reduced leaf complexity but normal leaf serration. We showed that SL1 and SALAD independently regulate CUC2a at different stages of leaf development to, respectively, regulate leaf complexity and leaf serration. Our results provide a clear and simple mechanism of how leaf complexity and leaf serration are coordinately as well as independently regulated to achieve diverse leaf forms. 
    more » « less
  7. Abstract The R2R3-MYB transcription factor FveMYB10 is a major regulator of anthocyanin pigmentation in the red strawberry fruits. fvemyb10 loss-of-function mutants form yellow fruits but still accumulate purple-colored anthocyanins in the petioles, suggesting that anthocyanin biosynthesis is under distinct regulation in fruits and petioles. We identified a green petioles (gp)-1 mutant from chemical mutagenesis in the diploid wild strawberry Fragaria vesca that lacks anthocyanins in petioles. Using mapping-by-sequencing and transient functional assays, we confirmed that the causative mutation resides in a FveMYB10-Like (MYB10L) gene and that FveMYB10 and FveMYB10L function independently in the fruit and petiole respectively. In addition to their tissue-specific regulation, FveMYB10 and FveMYB10L respond differently to changes in light quality, produce distinct anthocyanin compositions, and preferentially activate different downstream anthocyanin biosynthesis genes in their respective tissues. This work identifies a new regulator of anthocyanin synthesis and demonstrates that two paralogous MYB genes with specialized functions enable tissue-specific regulation of anthocyanin biosynthesis in fruit and petiole tissues. 
    more » « less